Search CORE

130 research outputs found

Efficient pebbling for list traversal synopses

Author: Matias Yossi
Porat Ely
Publication venue
Publication date: 01/01/2002
Field of study

We show how to support efficient back traversal in a unidirectional list, using small memory and with essentially no slowdown in forward steps. Using

O(\log n)

memory for a list of size

n

, the

i

'th back-step from the farthest point reached so far takes

O(\log i)

time in the worst case, while the overhead per forward step is at most

\epsilon

for arbitrary small constant

\epsilon>0

. An arbitrary sequence of forward and back steps is allowed. A full trade-off between memory usage and time per back-step is presented:

k

vs.

kn^{1/k}

and vice versa. Our algorithms are based on a novel pebbling technique which moves pebbles on a virtual binary, or

t

-ary, tree that can only be traversed in a pre-order fashion. The compact data structures used by the pebbling algorithms, called list traversal synopses, extend to general directed graphs, and have other interesting applications, including memory efficient hash-chain implementation. Perhaps the most surprising application is in showing that for any program, arbitrary rollback steps can be efficiently supported with small overhead in memory, and marginal overhead in its ordinary execution. More concretely: Let

P

be a program that runs for at most

T

steps, using memory of size

M

. Then, at the cost of recording the input used by the program, and increasing the memory by a factor of

O(\log T)

O(M \log T)

, the program

P

can be extended to support an arbitrary sequence of forward execution and rollback steps: the

i

'th rollback step takes

O(\log i)

time in the worst case, while forward steps take O(1) time in the worst case, and

1+\epsilon

amortized time per step.Comment: 27 page

arXiv.org e-Print Archive

CiteSeerX

The streaming $k$ -mismatch problem

Author: Clifford Raphaël
Kociumaka Tomasz
Porat Ely
Publication venue
Publication date: 09/04/2018
Field of study

We consider the streaming complexity of a fundamental task in approximate pattern matching: the

k

-mismatch problem. It asks to compute Hamming distances between a pattern of length

n

and all length-

n

substrings of a text for which the Hamming distance does not exceed a given threshold

k

. In our problem formulation, we report not only the Hamming distance but also, on demand, the full \emph{mismatch information}, that is the list of mismatched pairs of symbols and their indices. The twin challenges of streaming pattern matching derive from the need both to achieve small working space and also to guarantee that every arriving input symbol is processed quickly. We present a streaming algorithm for the

k

-mismatch problem which uses

O(k\log{n}\log\frac{n}{k})

bits of space and spends \ourcomplexity time on each symbol of the input stream, which consists of the pattern followed by the text. The running time almost matches the classic offline solution and the space usage is within a logarithmic factor of optimal. Our new algorithm therefore effectively resolves and also extends an open problem first posed in FOCS'09. En route to this solution, we also give a deterministic

O( k (\log \frac{n}{k} + \log |\Sigma|) )

-bit encoding of all the alignments with Hamming distance at most

k

of a length-

n

pattern within a text of length

O(n)

. This secondary result provides an optimal solution to a natural communication complexity problem which may be of independent interest.Comment: 27 page

arXiv.org e-Print Archive

Crossref

Explore Bristol Research

Sublinear Distance Labeling

Author: Alstrup Stephen
Porat Ely
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 24th Annual European Symposium on Algorithms (ESA 2016)
Publication date: 01/01/2016
Field of study

Dagstuhl Research Online Publication Server

A Simple Algorithm for Approximating the Text-To-Pattern Hamming Distance

Author: Kopelowitz Tsvi
Porat Ely
Publication venue: OASIcs - OpenAccess Series in Informatics. 1st Symposium on Simplicity in Algorithms (SOSA 2018)
Publication date: 01/01/2018
Field of study

The algorithmic task of computing the Hamming distance between a given pattern of length m and each location in a text of length n, both over a general alphabet Sigma, is one of the most fundamental algorithmic tasks in string algorithms. The fastest known runtime for exact computation is tilde O(nsqrt m). We recently introduced a complicated randomized algorithm for obtaining a (1 +/- eps) approximation for each location in the text in O( (n/eps) log(1/eps) log n log m log |Sigma|) total time, breaking a barrier that stood for 22 years. In this paper, we introduce an elementary and simple randomized algorithm that takes O((n/eps) log n log m) time

Dagstuhl Research Online Publication Server

Explicit Non-Adaptive Combinatorial Group Testing Schemes

Author: Porat Ely
Rothschild Amir
Publication venue
Publication date: 29/04/2008
Field of study

Group testing is a long studied problem in combinatorics: A small set of

r

ill people should be identified out of the whole (

n

people) by using only queries (tests) of the form "Does set X contain an ill human?". In this paper we provide an explicit construction of a testing scheme which is better (smaller) than any known explicit construction. This scheme has \bigT{\min[r^2 \ln n,n]} tests which is as many as the best non-explicit schemes have. In our construction we use a fact that may have a value by its own right: Linear error-correction codes with parameters

[m,k,\delta m]_q

meeting the Gilbert-Varshamov bound may be constructed quite efficiently, in \bigT{q^km} time.Comment: 15 pages, accepted to ICALP 200

arXiv.org e-Print Archive

CiteSeerX

Dagstuhl Research Online Publication Server